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PREFACE 

The Agriculture end Resources Inventory Surveys Through Aerospace Remote 
Sensing Is a multiyear program of research, development, evaluation, and appli- 
cation of aerospace remote sensing for agricultural resources, which began In 
fiscal year 1980. This program Is a cooperative effort of the U.S. Department 
of Agriculture, the National Aeronautics and Space Administration, the National 
Oceanic and Atmospheric Administration (U.S. Department of Commerce), the 
Agency for International Development (U.S. Department of State), and the 
U.S. Department of the Interior. 

The work which Is the subject of this document was performed within the Earth 
Resources Research Division, Space and Life Sciences Directorate, at the 
Lyndon B. Johnson Space Center, National Aeronautics and Space Administration. 
Under Contract NAS 9-1F300, personnel of Lockheed Engineering and Management 
Services Company, Inc., performed the tasks which contributed to the completion 
of this research. 
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1. INTRODUCTION 


It Is not possible to obtain precise early-season estimations of crop propor- 
tions solely by using Landsat data because there Is not enough accumulated 
Information from only one or two Landsat passes. For this reason, the follow- 
ing approach was proposed to estimate crop proportions early In the season 
(ref. 1). 

• a. Use a regresslon-baied prediction equation to obtain an a priori estimate 

for specific major crop groups. 

^ b. Modify this estimate using current -year Landsat and weather data. 

c. Break down major crop groups into specific crops by regression models. 

In this report, some preliminary results from the development and evaluation 
of appropriate models for the first portion of the proposed approach are 
presented. The developed models were tested and evaluated by using a data 
base for North Oakota, constructed from the published statistics compiled by 
the North Dakota Crop and Livestock Reporting Service. Because of the local 
data base used, the results are valid only for North Dakotj area crop propor- 
tion estimation. It Is believed that the same models can be applied to other 
regions of the United States and foreign countries. However, further tests of 
the models using data fron areas with significantly different environmental 
conditions are needed to confirm this belief. Although It has not been fully 
Investigated, most likely, the same models can be used In the third portion of 
the proposed approach, which Is the breakdown of major crop groups Into 
specific crops. 

% 

There are difficulties In using straight regression models In early-season 
crop proportion estimation. These difficulties and some possible solutions to 
them are examined In section 2. In section 3, new models aiming to solve 
these difficulties are proposed. Following the proposal of new models, the 
construction of the North Dakota data base for testing and evaluating the 
models Is discussed In section 4. Some preliminary modeling and predicting of 
results using the North Dakota data base are presented In sections 5 and 6, 
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respectively. Some Issues which were not Investigated In this study are 
discussed In section 7. 


I 
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Z. DIFFICULTIES IN USING STRAIGHT REGRESSION MOOELS IN PROPORTION ESTIMATION 


Consider the following standard general line** model: 

P • X 0 ♦ c 

n*l n *k k *1 n*l 

where 

P • a vector of proportions. 

X ■ the data matrix of explanatory variables. 

0 • a vector of unknown coefficients to be estimated from the sany>1e data, 
c • a vector of error terms. 

There exists at least three difficulties In applying the above model to crop 
proportion estimation problems. 

1. Because the dependent variable Is proportion, It assumes only values In 
the [0,1] Interval. The estimated or predicted values can be outside the 
Interval. This maker -he interpretation of the results difficult. 

2. The variance of pi Is: 

Var( Pi ) - P t (l - P 1 )/n 1 

where 

Pi « the population proportion. 

n^ • the sample size of the 1 th class. 

The condition of constant variance assumption In a standard general linear 
model Is no longer true except in the rather uninteresting case where all 
Pj 's are Identical. 

3. If some of the explanatory variables are highly correlated, then the 
application of the least-squares method to the data will yield unstable 
and Imprecise estimated coefficients. The high correlations among the 
explanatory variables (l.e., multlcol linearity) can cause exclusion of 
significant variables, which In turn, causes mlsspeclflcatlon of the 
model. A brief examination of these difficulties Is given In reference 2. 
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There are ways to solve the first difficulty, and some of the proposed solu- 
tions are by using the logit model, the logllnear model, and the probit trans- 
formations (refs. 3, 4, and 5). The variance stabilization technique can be 
used to transform .he data to make the variances constant (ref. 6). It Is 
argued that. If all the P^'s are within the Interval [u.2,0.8], then the 
second difficulty won't be a serious problem because the least-squares method 
Is relatively robust when the constant variance assumption Is violated (ref. 5). 

There are methods and procedures which have been proposed In dealing with the 
problem of multlcol linearity In the linear model. Most widely used are the f 

direct data augmentation method (ref. 7), the mixed estimation procedure 
(refs. 8 and 9), the Bayesian approach (refs. 10, 11, and 12), and the ridge 
regression procedure (refs. 13, 14, and 15). 

Simulation studies by persons from different professional fields have shown 
that, among these widely used methods or procedures, the ridge regression 
procedure yields the most encouraging results In dealing with the multlcol- 
llnearlty problem. The ridge regression estimator of 6 In the linear model Is 
defined as: 

0 (k) - (X'X + kI (c )“ 1 X' P 


where 

k ^ 0 Is the ridge parameter whose value Is to be determined. 

The p(k) is a biased estimate of a. The Induction of a small amount of bias 
In the estimation significantly reduces the variance o f the ridge estimator. 
This results In a smaller mean-squared error of the estimate than the ordinary 
least-squares estimate 

1 • (X'XfVv 

There are different methods proposed for determining the optimal values of the 
ridge parameter k. The use of ridge trace, empirical Bayes methods, and 

A 

methods based on p are among them. A survey of these methods and a simulation 
study on some of the Important ones are in references 2 and 16. 
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3. THE PROPOSEO MODELS 


To correct tho shortcoming! of the ordinary regression modol discussed In sec- 
tion 2, the use of the logit model for crop proport on estimation and the use 
of the ridge regression procedure for estimation of the regression coefficients 
are proposed. In the dichotomous case, the logit model can be written: 

or In the following transformed linear form: 


'l.t ■ ' < ' 9 . T-% • Vi.l.t * Vi.k.t * *l,t 


where 


Pi ,t * observed proportion of spring small grains of the 1 th stratum of 
year t. 

X 1*J.t “ v «1ue of the 1 th stratum of the jt* 1 Independent variable of year t. 
C 1 ,t " « rror term. 

The estimated proportions obtained from the logit model automatically satisfy 
the proportion definition, that Is: 


A A 

p 1,t * "l.t ‘ 1 

°s»i.t 5 1 
’l.t ■ 1 - Pl.t 

The logit model In the dichotomous case can be extended to the case where 
there are more than two major crop groups. In the extended case, there will 
be a - 1 equations If there are A > 2 major crop groups with 0 < p. <1 

and P1,t,l + Pl,t,2 + *** + Pi t t " ** All the equations are considered 
simultaneously when estimating the regression coefficients. 


and 

where 
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4. DATA 


A data set containing crop, weather, and economic data at both county and crop 
reporting district (CKO) levels In North Dakota from 1975 to 1980 was 
constructed for testing the models. The data were based upon the published 
statistics compiled by the North Dakota Crop and Livestock Reporting Service. 
The variables Included In the data set were planted acres; prices of various 
spring small grains for the current and previous years; total acres of 
Individual counties and CRD's; temperature and r reclpltatlon for April, May, 
and June; and some dumniy variables for temporal and spatial effects (refs. 17, 
13, 19, 20, 21, 22, 23, and 24). 

A listing of the data set Is given In appendix A. Listing of summary statis- 
tics and correlation matrices at both county and CRD levels, given In 
appendices B and C, provide some basic Information about the data set. 

The number of acres of spring small grains planted was calculated by summing 
the numbers of acres planted In spring wheat (excluding durum), durum, barley, 
oats, rye, and flax. The previous year's weighted price of spring small 
grains was obtained as a weighted average for spring mall grains, using the 
numbers of acres planted as weights. The proportion of spring small grains 
were calculated by dividing the total acres of spring small grains planted by 
the total acres of each county or CRD. 

Weather data were published by weather station rather than by county and CRD 
levels. To be consistent, the weather data were transformed from weather sta- 
tion to county and CRD levels. If weather data were puollshed by more than 
one station within a county, then th^ average temperature and precipitation 
were used as the weather data for the county. A few counties do not contain 
weather stations with published data. In this case, the averages of the data 
published by stations near a county In this category were used as the weather 
data for the county. 

The data subset at the county level has 265 obs -vatlons. The data subset at 
the CRD level contains 45 observations. 
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5. SOME PRELIMINARY MODELING RESULTS 

5.1 MOOELS 

The following three mode<» were tested at both county and CRO levels. 

1. (ML1) straight regression model (stepwise) 

(PSSG) i.t • 9 o ♦ jSi V 0CRBJ) i.t * V UPSSG >t.t-l + 

* i u (P£RAPL) 1t * 6 12 (PERMAY) 1t ♦ ^(PERJURE), >t ♦ « j>t 

where 

(PSSG)i ♦ • proportion of spring small grains of the 1 th unit of 

year t. 

DCRD1, •••.DCRD8 « dumny variables for spatial effect. 

( WPSSG) •§ — 1 * year's weighted price of spring small grains. 

(PERAPL) 1ft ■ April precipitations of the 1 th unit of year t. 

(PERMAY)^ t ■ May precipitations of the 1 th unit of year t. 

(PERJUNE) itt - June precipitations of the 1 th unit of year t. 

e 1 ,t * error term 

2. (ML2) logit model 

(PSSG) „ « , ? xp ( E L + e. 

I.t 1 + exp(E) e 1,t 

where 

E^r^JrjfOCRDj),^^, (WPSSG) 

* *10 <PSSS, 1.t-l * Y u ( p ERA p L)i t * v 12 (PERMAV) 1 t 

+ y 13 (perjune) 1 t 
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3. (ML3) linearized logit model 

(PSSG) . 

Y i,t “ log e i - (Psse!^ t 

* ‘■io< PSSG) i.t-i + “n< PERM ->).t * v (PERMAV >i.t 

. « 13 (PEIWUNE) j(t . c i>t 
5.2 MODELS AT THE COUNTY LEVEL 

The results from the computer runs of the three models at the county level are 
given In appendixes D, E, and F. The stepwise regression was tested for the 
purpose of getting some Information about the relationship between the 
dependent variable, the proportion of spring small grains, and the explanatory 
variables. As expected, the most Important factor In determining the current 
year proportion of spring small grains Is last year's proportion. This 
variable, together with the Intercept, explains approximately 94 percent of 
the total variation of the crop proportion. 

Other significant variables are last year's weighted price; precipitation for 
April, May, and June; and the dummy variable for CRD 5. All the significant 
variables explain approximately 96 percent of the total variation. 

The nonlinear logit model was fixed by using the NLIN procedure In the 
Statistical Analysis System (SAS). The Independent variables included in the 
model are those significant variables plus all the dummy variables for spatial 
effect. 

The Marquardt iteration method was used In the estimation of the coefficients. 
The Marquardt method of Iteration Is equivalent to performing the ridge 
regression procedure. It Is useful when some of the independent variables are 
highly corrtlated. 
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The plots of observed and estimated proportions against COUNTYCO by year are 
given In figures 5-1 through 5-5. The variable COUNTYCO Is used to identify 
counties and CRD's In a plot. Values 10, 20, • ••, 90 are for CR01, CR02, •»*, 
CR09. Values li und 97, for example, are for the first county of CR01 and the 
seventh county of CR09, respectively. The value 1 Is for state-level data. 

As can be seen from the plots, the estimated proportions are very close to the 
observed ones. 

The linearized logit model was fixed by using the GLM procedure In SAS. The 
Independent variables used In the nonlinear logit model were used in this 
model. These significant variables explain approximately 95 percent of the 
total variation. The observed and estimated proportions from this model were 
plotted In figures 5-6 through 5-10. The plots show similar results as those 
obtained from the nonlinear logit model. 

5.3 MODELS AT THE CRD LEVEL 

The results of computer runs for the three models at the CRO level are pre- 
sented in appendixes G, H, and I, respectively. At the CRO level, last year's 
proportion still plays an extremely Important role In determining the current 
year's proportion. The variable plus the Intercept explain approximately 
96 percent of the total variation. Other significant variables at this level 
are dummy variables for CR07, year. May and June temperatures, and April 
precipitation. All the significant variables explain 99 percent, which Is 
virtually all the total variation. 

Although stepwise regression models at county and at CRD levels did not pick 
tp an Identical set of Independent variables, the same set of Independent var- 
iables used In the logit models at the county level was used In the logit 
(both nonlinear and linearized) models at the CRD level for the following 
reasons. 

a. Both stepwise regression models picked up the most Important variable, 
last year's proportion. 

b. Computer programming Is convenient. 

c. Comparison of the same model at different data levels Is easy. 


5-3 



• •55 

1.59 

••*5 

9.M 

••35 

cn 

^ ••!• 
••25 

• •?• 

• •IS 

•• !• 

• ••5 



•••• ♦ 

ii it M"if 

COUNT YCO 

Noret n oas hioocn 

Figure 5-1.- The plot of observed and estimated proportions from the nonlinear logit model for year 1. 


OKIbiNAL PAGE S3 

OF POOR QUALITY 



ISTSCAL ANALYSIS SYSTEM 

r» s !kk stfg i? : 

«S! 


1412? r»IOAY« 


21 * IMI T 


• * 4 S • 


••AS 


• •55 • 




• •45 ♦ 


••44 ♦ 


•«3S ♦ 


Ml * 


••25 


• .?• ? 


• •IS • 




• •!• * 


• ••S « 
I 


NOTE t 


~ ~~ II *15 if 23**55 31 35 Sf O 4? St 55 Sf 43 4? T| ?S**Ti"lj*l?**ir*«5* 

COUNTYCO 

t 00S MAO MI5SIN6 VALUES 5 00S HlDOtH 


Figure 5-2.- The plot of observed and estimated proportions from the nonlinear logit model for year 2. 
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Figure 5-5.- The plot of observed end estimated proportions from the nonlinear logit model for year 5. 
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The observed and estimated proportions from the CRD logit (nonlinear) model 
are plotted against COUNTYCO by year In figures 5-11 through 5-15. The 
observed and estimated proportions from the linearized logit are plotted 
against COUNTYCO by year In figures 5-16 through 5-20. COUNTYCO 's at this 
level are 10, 20, 30, 40, 50, 60, 70, 80, and 90 for the nine CRO's In North 
Dakota. As can be seen from these plots, the estimated proportions almost 
duplicate the observed proportions In both linear and nonlinear logit models. 

5.4 SOME FINDINGS 

Estimated regression coefficients and their standard errors of the linearized 
logit model at both county and CRD level* are summarized In table 5-1. Some 
main findings from the model fittings follow. 

a. At both levels of the data, the most important factor determining current 
year crop proportion Is last year's proportion. 

b. At both levels of the data, other significant Independent variables are 
last year's weighted price of spring small grains and precipitation for 
April, May, and «Ajne. 

c. All three models which were tested performed well. 

d. Dumny variables for temporal and spatial effects are not very Important. 

e. The standard errors of estimation of the estimated coefficients at the CRO 
level are greater thar those at the county level. 

f. The R^ computed from CRO data is greater than that computed from county 
data, (In fact. It can be substantially greater.) 


TABLE 0-1.- ESTIMATED REGRESSION COEFFICIENTS AND THEIR STANDARD ERRORS 

OF THE LINEARIZED LOGIT MODEL 


Parameter 

Model l a 

Model 2^> 

Estimate 

Standard error 

— 

Estimate 

Standard error 

Intercept 

-2.5018 

0.0685 

-1.9112 

0.2324 

(WPSS6), >t _ x 

.0410 

.0170 

.1181 

.0272 

(DCRDl) 1ft 

.1159 

.0398 

-.0290 

.0683 

(DCRD2) 1>t 

.0569 

.0403 

.0274 

.0478 

(DCRD3) i>t 

-.0696 

.0409 

.1267 

.0926 

(DCRD4) i 

.0068 

.0478 

-.2928 

.1388 

(DCRD5) 1|t 

.0129 

.0377 

-.0377 

.0470 

(DCRD6) 1t 

-.1013 

.0454 

.1478 

.1109 

(DCRD7) ift 

.0273 

.0433 

-.2563 

.1358 

(DCRD8) 1>t 

.0403 

.0453 

-.2228 

.1224 

(PSSO^t.i) 

4.9286 

.1307 

3.0643 

.6743 

(PERAPR) 1t 

-.0322 

.0097 

-.0710 

.0147 

(PERMAY) 1>t 

-.0496 

.0085 

-.0266 

.0129 

(PERJUNE) 1jt 

.0241 

.0072 

.0107 

.0104 

R 2 

0 

.952981 

0 

.990776 


a Model 1, at county level: 248 completed observations 

b Model 2, at CRD level: 45 completed observations 
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Figure 5-11.- The observed and estimated proportions from the CRO 
logit model are plotted against COUNTYCO for year 1. 
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Figure 5-14.- The observed and estimated proportions from the CRD 
logit model are plotted against COUNTYCO for the year 4. 
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Figure 5-19.- The observed and estimated proportions from the linearized 
logit model are plotted against COUNTYCO for year 4. 
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6. SOME PRELIMINARY PREDICTION RESULTS 


6.1 THEORETICAL BACKGROUND 

By using the method of least squares to estimate e In the model y ■ xg + c , 
one may then attempt to predict the crop proportions In 1980. Let each row of 
the m by k matrix X* be the values taken by the Independent variables during 
the prediction period. If the assumptions of the fitted model are still valid 
for the m observations, that Is: 


Y* ■ X*0 ♦ e* 


then 


a. X # g Is the best linear unbiased predictor of 

E(T*) • X*g 


with covariance matrix 


^ . * o 2 [X.(X’*)' ! xl] 
X„6 


and 


b. X*g Is the best linear unbiased predictor of 

Y * * M + e* 

with covariance matrix 

2 * ° 2|:i k + x *(x’x) _1 x;] 

x*5 

See figure 6-1 for the simple regression case. 
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Simple regression with Y* being one dl mens 1 one I : 

Y » a ♦ UX ♦ e 

Given X*» then 

. . 

E(Y*|X*) • a* *X # 




b. 


with variance Var(»+ px*) • o 2 

?* ■ i ♦ K 

with variance Var(<* + flX*) • 0 2 


1 (X. - T) 

-i 4- 

n 


(X 1 - XT 


1 (X# * T) z 

1 7 

(x, - xr^ 


/ / 



Figure 6-1.- An Illustration of the simple regression case. 
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The 100(1 - a) percent confidence ellipsoids for E(Y*|x*) and Y* are as 
follows: 

a. X,e « N (n cx #0 ,o 2 x # (x , x)- 1 x;] 

Confidence ellipsoid for E(Y*|X*): 

*,<# - »)’[*. (x'xr'x;] (» - s)x; < ■?F a>n , ifl . k 

b. x.8 - X.8. ° 2 [i k + X.IX'X)'^;) 

Confidence ellipsoid for Y*: 

X.(8 - 8)'[1 ♦ X,(X'X)" 1 X;]" 1 (8 - 6)x; < ■?F (Mun . k 

The model data at the CRD level 
m ■ 9 

o 2 - 0.00409026 

^0. 05, 9, 31 * 3,03 

6.2 PRELIMINARY RESULTS 

The 1980 data used In 'he prediction are listed in appendix J. The notations 
of the data are the same as those listed In appendix A. 

Because all three models that were considered yielded equally good results, only 
the linearized logit model was used In the evaluation of predictability of the 
models. The 1980 Independent variable data, both at county and at CRD levels, 
are plugged into the fitted linear logit model to get the 1980 predicted crop 
proportions. In order to learn the predictability of the models, the predicted 
crop proportions a~e then compared with the 1980 proportions, which are calcu- 
lated directly from the published data. The predicted, calculated proportions 
and the differences at both county and CRD levels are presented in tables 6-1 
and 6-2, respectively; they are plotted ' n figures 6-2 and 6-3, respectively. 

The average squared error of prediction is 0.00352592 at the county level 
(52 counties) and 0.00063521 at the CRD level (9 CRD's). 
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TABLE 6-1.- THE PREDICTED AND CALCULATED PROPORTIONS AND 
THE DIFFERENCES AT THE COUNTY LEVEL 


Obser- 

vation 

County 


1 

Burke 

11 

2 

Divide 

12 

3 

Mount ral ! 

13 

4 

Renvl lie 

14 

5 

Ward 

15 

6 

W1 1 1 1 ams 

16 

7 

Benson 

21 

8 

Bottineau 

22 

9 

McHenry 

23 

10 

Pierce 

24 

11 

Rolette 

25 

12 

Cavalier 

31 

13 

Grand Forks 

32 

14 

Nelson 

33 

15 

Pembina 

34 

16 

Ramsey 

35 

17 

Towner 

36 

18 

Walsh 

37 

19 

Dunn 

41 

20 

McKenzie 

42 

21 

McLean 

43 

22 

Mercer 

44 

23 

Oliver 

45 

24 

Eddy 

51 

25 

Foster 

52 

26 

Kidder 

53 

27 

Sheri dan 

54 


Proportion 
of spring 
small grains 

Predicted 
proportions 
of spring 
small grains 

Difference in 
predicted and 
published spring 
small grain 
proportions 

0.323671 

0.307626 

0.016045 

.338221 

.335807 

.002414 

.269551 

.282295 

-.012744 

.451467 

.484650 

-.033182 

.370520 

.387766 

-.017245 

.290395 

.295436 

-.005041 

.388342 

.351792 

.036550 

.482633 

.432534 

.050099 

.290131 

.283453 

.006678 

.389722 

.402973 

-.013251 

.362644 

.348666 

.013978 

.607019 

.541898 

.065121 

.490699 

.379811 

.110888 

.449906 

.387692 

.062214 

.516709 

.408852 

.107858 

.481020 

.393475 

.087545 

.553392 

.471408 

.081984 

.548212 

.456271 

.091941 

.146680 

.166588 

-.019908 

.125971 

.147770 

-.021799 

.358202 

.340999 

.017203 

.206484 

.207201 

-.000717 

.219747 

.202496 

.017251 

.340059 

.261390 

.078669 

.402374 

.350540 

.051834 

.201008 

.210422 

-.009414 

.327825 

i 

.307767 

.020058 
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TABLE 6- 1 Concluded 


Obser- 

vation 

County 

H 

Proportion 
of spring 
small grains 

Predicted 
proportions 
of spring 
small grains 

Difference In 
predicted and 
published spring 
small grain 
proportions 


Stutsman 

55 

0.352943 

0.282379 

0.070563 


Wells 

56 

.449745 

.426989 

.022756 

30 

Barnes 

61 

.507416 

.416293 

.091123 

31 

Cass 

62 

.565323 

.427776 

.137547 

32 

Griggs 

63 

.441461 

.315426 

.126036 

33 

Steele 

64 

.551937 

.416136 

: : I 

34 

Traill 

65 

.577272 

.499878 


35 

Adams 

71 

.251833 

.228150 

.023683 

36 

Billings 

72 

.068591 

.129781 

-.061191 

37 

Bowman 

73 

.211538 

.209436 

.002103 

38 

Golden Valley 

74 

.158253 

.175076 

-.016823 

39 

Hettinger 

75 

.373402 

.362535 

.010867 

40 

Slope 

76 


.171852 


41 

Stark 

77 

.280680 

.263685 

.016996 

42 

Burleigh 

81 

.250000 

.228753 

.021247 

43 

Emmons 

82 

.301065 

.306140 

-.005075 

44 

Grant 

83 

.188607 

.184216 

.004390 

45 

Morton 

84 

.188639 

.185074 

.003566 

46 

Sioux 

85 

.093495 

.125946 

-.032451 

47 

Dickey 

91 

.327127 

.259582 

.067545 

48 

La Moure 

92 

.439178 

.394666 

.044512 

49 

Logan 

93 

.303134 

.288294 

.014841 

50 

McIntosh 

94 

.393775 

.335847 

.057928 

51 

Ransom 

95 

.337907 

.260638 

.077268 

52 

Richland 

96 

.334958 

.276545 

.068412 

53 

Sargent 

97 

.380825 

.275197 

L. 

.105628 
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TABLE 6-2*- THE PREDICTED AND CALCULATED PROPORTIONS AND THE DIFFERENCES AT THE CRD LEVEL 


Obser- 

vation 

CRD 

CRD 

code 

Proportion 
of spring 
small grains 

Observed 

log 

odds 

Estimated 

log 

odds 

Predicted 
proportions 
of spring 
small grains 

Difference in 
predicted and 
published spring 
small grain 
proportions 

1 

Northwest 

10 

0.330254 

-0.7070 

-0.6140 

0.351147 

-0.020893 

2 

North Central 

20 

.381331 

-.4839 

-.4283 

.394530 

-.013199 

3 

Northeast 

30 

.524448 

.0979 

-.0371 

.490/32 

.033716 

4 

West Central 

40 

.204559 

-1.3580 

-1.2315 

.225927 

-.021368 

5 

Central 

50 

.341577 

-0.6563 

-0.6241 

.348857 

-.007280 

6 

East Central 

60 

.533956 

.1360 

-.0642 

.483966 

.049990 

7 

Southwest 

70 

.215311 

1.2932 

-1.1661 

.237561 

-.022250 

8 

South Central 

80 

.209579 

-1.3275 

-1.2134 

.229103 

-.019523 

9 

Southeast 

90 

.360794 

-0.5719 

-0.6120 

.351599 ! 

1 

.009195 

L 




statistical analysis system 

(tit a smftsHRc. isat sus is : 

SIS! 


I5t20 VCDMESOAV. JULY 15. Ifil A 


0.S5 ♦ 


0.56 ♦ 


0.45 ♦ 


0.40 


• I 


I 


0.3$ * 


0.36 ♦ 


0.25 ♦ 


0.20 ♦ 


0.15 ♦ 


0.10 ♦ 


0.0S ♦ 
I 


*11 ir*i; 23 I? 31 35* 30**43**47 Sl**Ss~ S9 03**4? II* ?5 70 03**07**01**05* 


COUNTYCO 


NOTE! 1 OBS MAO MISSING VALUES 3 O0S HlOOEN 


Figure 6-2.- A plot of the predicted and calculated proportions 
and the differences at the county level. 
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Figure 6-3.- A plot of the predicted and calculated proportions 
and the differences at the CRD level. 



The covariance matrices of the best linear unbiased predictor X # g of E(Y # |x*) 
and Y* | X* At the CRD level are given In tables 6-3 and 6-4, respectively. 

Comparisons between 1980 published and predicted acreages at both county and 
CRO levels are given In tables 6-5 and 6-6, respectively. At the county 
level, the published 1980 total planted acreage of spring small grains Is 
14,857,300 acres (excluding Slope County, North Dakota). The predicted 1980 
planted acreage Is 13,504,169.67 acres (excluding Slope County). The model at 
the county level underpredicted the 1980 planted acreage by 1,353,130 acres or 
9.1 percent of the published data. 

At the CRD level, the published data are 14,980,000 acres; the predicted 
planted acreage Is 15,117,152.11 acres. The model at the CRD level over- 
predicted the 1980 planted acreage by 137,151.11 acres or 0.92 percent. In 
terms of the absolute differences between the 1980 published and predicted 
acreages, the model at the CRO level yields better prediction than the model 
at the county level when North Dakota data were tested. A summary of the 
predictability of the models In terms of planted acreage Is in table 6-7. 
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T.'.BLE 6-3.- ESTIMATE OF THE COVARIANCE MATRIX OF E(T.|X*) 


Covariance 

Matrix 

Column 1 

Column 2 

Column 3 

Coluan 4 

Column 5 

Co!uan 6 

Coluan 7 

Column 8 

Column 9 

Row 1 

0.0173474 

0.0248336 

0.0267834 

0.0277973 

0.0370541 

0.0200517 

0.0248632 

0.0278089 

0.0314446 

Row 2 

.0248336 

.0694118 

.0814133 

.0746374 

.0955308 

.0565694 

.0911771 

.0866872 

.0897132 

Row 3 

.0267834 

.0314133 

.0986158 

.0883136 

.112781 

.0673055 

.110917 

.104425 

.107225 

Row 4 

.0277973 

.0746374 

.0883136 

.0822022 

.104522 

.0616231 

.0981924 

.0941519 

.0977423 

Row 5 

.0370541 

.0955308 

.112781 

.104522 

.138987 

.0793024 

.123051 

.119782 

.125941 

Row 6 

.0200517 

.0565694 

.0673055 

.0616231 

.0793024 

.0473392 

.0753858 

.0717396 

.0745453 

Row 7 

.0248632 

.0911771 

.110917 

.0981924 

.123051 

.07S2958 

.131003 

.119152 

.119775 

Row 8 

.0278089 

.0866872 

.104425 

.0941519 

.119782 

.071739C 

.119152 

.11236 

.114719 

Row 9 

.0314446 

.0897132 

.107225 

.0977423 

.125941 

.0745453 

.119775 

.114719 

.118841 



TABLE 

6-4.- ESTIMATE OF THE 

COVARIANCE MATRIX OF fjX # 



Covariance 

matrix 

Column 1 

Coluan 2 

Coluan 3 

Coluan 4 

Column 5 

Coluan 6 

Coluan 7 

Column 8 

Coluan 9 

Row 1 

1.01735 

0.0248336 

0.0267834 

0.0277973 

0.0370541 

0.0200517 

0.0248632 

0.0278089 

0.0314446 

Row 2 

.0248336 

1.06941 

.0814133 

.0746374 

.0955308 

.0565694 

.0922771 

.0866872 

.0897132 

Row 3 

.0267834 

.0814133 

1.09862 

.0883136 

.112781 

.0673055 

.110917 

.104425 

.107225 

Row 4 

.0277973 

.0746374 

.0883136 

1 .08226 

.104522 

.0616231 

.0981924 

.0941519 

.0977423 

Row 5 

.0370541 

.0955308 

.112781 

.104522 

1.13699 

.079 ‘*024 

.123051 

.119782 

.125941 

Row 6 

.0200517 

.0565694 

.0673055 

.0616231 

.0793024 

1.04734 

.0753858 

.0717396 

.0745453 

Row 7 

.0248632 

.0911771 

.110917 

.0981924 

.123051 

.0753858 

1.131 

.119152 

.1199775 

Row 8 

.0278089 

.0866872 

.104425 

.0941519 

.119782 

.0717396 

.119152 

1.11236 

.114719 

Row 9 

.0314446 

.0897132 

.107225 

.0977423 

.125941 

.0745453 

.119775 

.114719 

1.11884 




TABLE 6-5.- THE 1980 PUBLISHED AND PREDICTED ACREAGE OF COUNTIES 


County 


County 

code 


Mountral 1 
Renvl 1 1e 


HI 1 Hams 


Bottineau 

McHenry 

Pierce 

Rolette 

Cavalier 

Grand Forks 


Pembl na 


McKenzie 

McLean 

Mercer 

Oliver 


E 


Kidder 

Sheridan 


Spring small 
grains, pre- 
dicted acreage 

Spring small 
grains, pub- 
lished acreage 

220 309 

231 800 

279 391 

261 400 

328 637 

313 800 

274 816 

256 000 

507 259 

484 700 

390 259 

383 600 

315 881 

348 700 

464 230 

518 000 

340 869 

348 900 

267 703 

258 900 

203 733 

211 900 

524 384 

587 400 

349 547 

451 600 

246 882 

286 500 

294 112 

371 700 

314 276 

384 200 

314 674 

369 400 

375 529 

451 200 

212 380 

187 000 

258 857 

220 500 

450 665 

473 400 

138 178 

137 700 

93 440 

101 400 

106 229 

138 200 

144 703 

166 100 

182 882 

174 700 

194 804 

207 500 

















TABLE 6-5.- Concluded 


Obser- 

vation 

Cou.ity 

County 

code 

Spring small 
grains, pre- 
dicted acreage 

Spring small 
grains, pub- 
lished acreage 

28 

Stutsman 

55 

409 156 

511 400 

29 

Wells 

56 

354 982 

373 900 

30 

Barnes 

61 

394 046 

480 300 

31 

Cass 

62 

478 835 

632 800 

32 

Grl ggs 

63 

143 329 

200 600 

33 

Steele 

64 

189 092 

250 800 

34 

Traill 

65 

275 453 

318 100 

35 

Adams 

71 

144 410 

159 400 

36 

Billings 

72 

94 605 

50 000 

37 

Bowman 

73 

156 826 

158 400 

38 

Golden Valley 

74 

113 618 

102 700 

39 

Hettinger 

75 

263 113 

271 000 

40 

Slope 

76 

134 732 


41 

Stark 

77 

222 086 

236 400 

42 

Burleigh 

81 

237 904 

260 000 

43 

Emmons 

82 

294 482 

289 600 

44 

Grant 

83 

196 419 

201 100 

45 

Morton 

84 

227 418 

231 800 

46 

SI oux 

85 

88 908 

66 00C 

47 

01 ckey 

91 

189 889 

239 300 

48 

La Moure 

92 

286 938 

319 300 

49 

Logan 

93 

184 692 

194 200 

50 

McIntosh 

94 

213 223 

250 000 

51 

Ransom 

95 

143 622 

186 200 

52 

R1 chland 

96 

256 457 

319 900 

53 

Sa rgent 

97 

150 236 

207 900 

TOTAL 

13 638 902 

14 857 300 
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TABLE 6-6.- THE 1980 PUBLISHED AND PREDICTED ACREAGE BY CRD 


Obser- 

vation 

CRD 

CRD 

code 

Total 

CRD 

area, 

ml 2 

Spring small 
grains, 
published 
acreage 

Spring small 
grains, 
predicted 
acreage 

Difference In 
predicted and 
published 
spring small 
grain acreage 

1 

Northeast 

10 

9 232 

1 951 300 

2 074 747 

-123 447 

2 

North Central 

20 

6 910 

1 688 400 

1 744 771 

-58 371 

3 

Northeast 

30 

8 646 

2 902 000 

2 715 434 

186 566 

4 

West Central 

40 

8 555 

1 120 000 

1 236 995 

-116 995 

5 

Central 

50 

7 190 

1 571 800 

1 605 299 

-33 499 

6 

East Central 

60 

5 509 

1 882 600 

1 706 349 

176 251 

7 

Southwest 

70 

7 987 

1 100 600 

1 214 337 

-113 737 

8 

South Central 

80 

7 817 

1 046 500 

1 146 173 

-9 763 

9 

Southeast 

90 

7 435 

1 716 800 

1 673 047 

43 753 


TOTAL 


69 281 

14 980 000 

15 117 152 

-137 152 


TABLE 6-7.- SUMMARIES UF THE PREDICTABILITY OF THE MODELS 


Item 

Model 1 

Model 2 

Published 1980 

a 14 857 300 

14 980 000 

planted acreage 



Predicted 1980 

a 13 504 170 

15 117 152 

planted acreage 



D1 fference 

1 353 130 

-137 152 

% difference 

9.1 

-0.92 


Excluding Slope County. 
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7. ISSUES AND REMARKS 


The development and tests of the proposed models were based upon the usual 
assumptions of standard general linear theory*, l.e., normality, constant 
variance, and Independence of the error term. There Is no doubt that the 
standard theory must be modified to suit the conditions of the real problem. 

In this preliminary study, possible modifications to the general model were 
Investigated during the process of modeling. However, It was decided that 
modifications would not be Incorporated In the proposed models for :he 
following reasons. 

a. The assumptions of normality and of constant variance probably will not be 
serious problems In the proposed model since It has been shown to be 
relatively robust. This Is especially true for the assumption of constant 
variances. If all the observations are within the range 0.2 < P < O.R. If 
modifications to the proposed models are desired In order to satisfy the 
two assumptions, transformations of the data (discussed in Chapter 10 of 
reference 2) can be used. 

b. The Independence assumption concerning the dependent variable or error term 
Is the most difficult problem to deal with. Violations of the assumption 
also have the most serious consequences In statistical Inferences. Note 
the following quote by Scheffe (ref. 6). "In general, of the three kinds 
of possible departures from assumptions we have considered, those caused by 
lack of Independence are the most formidable to cope with." 

In the proposed models. It was assumed that the error terms are Independently, 
identically, and normally distributed with zero mean and unit variance. This Is 
the simplest assumption one can make. Because the combination of time-series 
and cross-section data was used In the proposed model, there are temporal and 
spatial correlations In the error terms. The Inclusion of lagged dependent 
variables In the models further compounds the problem because the crop propor- 
tions of a county or a CRD are certainly affected by previous-year proportions 
of the particular county or CRD and the proportions of surrounding counties or 
CRD’s. To reflect the conditions of the real problem, a covariance structure of 
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temporal effect and spatial effect should be developed according to the real 
situations. This Is an area where further research is needed. 

The dependence problem Is somewhat alleviated if one assumes that only tenoral 
effects exist and a county or a CRD Is uncorrelated with all other counties or 
CRD's for the current and previous years. Under this assumption (l.e., the 
error terms are autocorrelated), there are three main consequences If the 
standard general linear model Is applied. 

1. Estimates of a are unbiased, but the sampling variances of the estimates 
may be unduly large when compared with those achievable by a slightly 
different method of estimation. 

2. The sampling variances of the estimates are likely to be seriously under- 
estimated. 

3. Sampling variances of the predictors will be needlessly large. 

In addition to autocorrelated error terms, the proposed models also contain 
lagged dependent variables. In this case, there are other possible assump- 
tions about the error terms. Some of the possibilities are as follows 
(ref. 25): 


a. 


£ t.t - u i.t ; 0< 


and 

u i.t ~ "<°* ° u 2 > 

b. 


c i.t • v - * 0 < : 


and 

u i.t •“Vt-i ♦ "i.t 


with 

1 p f < 1 and ~ N(0,o^) 

c. 


e 1.t " pe 1,t-i + v 1,t 


with 

1 p 1 < 1 and v ~ N(0, o^) 

1 ,t v 
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If the error terms satisfy one of the above assumptions and there are lagged 
dependent variables Included In the models, then the ordinary least-squares 
estimates of 6 become biased and Inconsistent; l.e., the bias remains even If 
the sample Is large* The asymptotic bias Is a function of the true coefficient 
of the lagged dependent variable and the true autocorrelation coefficient 0 . 

It seems extremely hopeless to apply the standard general linear model theory 
to a study In which autocorrelated error terms and lagged dependent variables 
exist. There are encouraging results from studies In this area. It Is shown 
empirically and theoretically that If the absolute value of the true coeffi- 
cient of the lagged dependent variable Is large, then the asymptotic bl e 3 . will 
be small even with large values of the autocorrelation coefficient 0 (ref. 26). 
It Is further shown that the presence of Independent variables In a model 
tends to reduce the absolute values of the asymptotic biases (ref. 27). 

In the proposed stepwise models, the dependent variable Is the proportion of 
spring small grains. The true coefficient of the proportion 1 year before Is 
very big. (The least-squares estimates are 0.9288 and 0.9488 at county and 
CRD levels, respectively.) In addition to the lagged variable, there are 12 
Independent variables Included In the models. The large value of the true 
coefficient of the lagged dependent variable and the large number of Indepen- 
dent variables Included In the models are expected to make the asymptotic 
bUses In the estimates become minimal. 

It was proposed to use the ridge regression procedure to estimate the regres- 
sion coefficients In the logit models. There are two ways to Incorporate the 
procedure In the estimation and prediction of crop proportions. One way Is to 
specify the Marquardt Iteration method In the NUN procedure in SAS. 

Marquardt's method Is equivalent to performing ridge regressions. The other 
way Is to linearize the logit model and apply directly -he ridge regression 
estimation techniques to estimate the coefficients. 
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In this study, the first way was used In the nonlinear logit models. The 
ridge regression procedure was not applied to the linearized logit models for 
the following reason. Since the linear logit models fitted North Dakota data 
very well at both levels, little Improvement can be made by using methods 
other than the ordinary least- squares estimators. 

The alternative estimation technique to the ordinary least-squares method will 
be Investigated more thoroughly when the proposed models are tested with data 
from other areas. 
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APPENDIX A 


A LISTING OF THE CONSTRUCTED DATA SET 



APPENOIX A 


' LISTING OF THE CONSTRUCTED DATA SET 


The following symbol definitions are to be used In Appendixes A, B, C, D, E, 
F, G, H, I, and J of this document. 


Symbol definitions: 

AREA 
AREACRES 
BARLEY 
COUNTY CO 
CV 

OCR 01 through 0CR08 
OF 

DURUM 

0YR1 through DYR4 
FLAX 

LEVEL OR DATA LEVEL 
N 

OATS 

OBS 

P2CPSSG 

POBARLEY 

P DURUM 

PERAPL 

PERMAY 

PERJUNE 

POATb 

PPSSG1 

PSPW 

PSSG 

R2CPSSG 

RPSSG1 


Acres of a county or CRD 
640 times the area 
Acres of barley planted 
County code 

Coefficient of variation 

Dumn\y variables for spatial effect 

Degrees of freedom 

Acres of durum wheat planted 

Dumnty variables for temporal effect 

Acres of flax planted 

C represents county; CRD; ST represents state 
Number of observations 
Acres of oats planted 
Obse. /at Ion 

CRD-observed proportion 

Last year's price of barley 

Last year's price of durum wheat 

Precipitation for April 

Precipitation for May 

Precipitation for June 

Last year's price of oats 

Predicted proportion of spring small grains 

Last year's price of spring wheat without 
durum wheat 

Observed proportion of spring small grains 

PSSG - P^PSSG * R2CPSSG 

Difference obtained by subtracting the 
predicted proportion from observed proportion 
of spring small grains 
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RYE 

SPW 

SSG 

TACRES1 

TEMPAPL 

WPSSG 

YEAR 

Note: 


Acres of rye planted 

Acres of spring wheat other than durum wheat 
planted 

Total acres of spring small grains planted 

Pevlous year's total acres of the county 

TEMPMAY, TEMPJUNE Temperatures for April, May, and June, 

respectively 

Last year's weighted price of spring small 
grains 

1 - 1975, 2 » 1976, 3 - 1977, 4 - 1978, 

5 - 1979 

All variable names with the last letter 1 are the same as those 
without 1 with the exception that they are logged for one year. 
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—0 . 1 7S>0 
0.2*47 

O.OOtlbJ 

1.0000 

O.0B2I0 

0.3910 

0.00000 
1. 0000 

0.06130 

0.34*2 

-0.31009 

O,G370 

Of** 

0.1 i***) 
0 . **•* * 

0.00000 

1.0000 

0 • 0 16493 
0.9123 

O.0O0O0 

1.0000 

fi.tf 1 330 
0.4)03 

-0.30 3a 7 
O.Otfw* 

T£A6 

-0.?*2*3 
0 • OoO t 

ti.U00O4 
1 .0000 

-0.0332* 

0.7J t»5 

0.00000 

1.0000 

-0 • i/5 72* 
0.7066 

-0.0.'?** 

0.30*1 

S*-*i 

0. Iba 7b 
C.^bM 

-0.220J2 
0. 1*54 

0.63310 

0.0001 

—0.220 32 
0.1*b9 

0,60977 

0.0901 

-0.11073 

0.*69* 


0.fl»*l ) 

0.47*-, 

0.*3oM 

0.0016 

0.6*910 

0.9001 

0.*366l 

0.0016 

0.3*4b« 

0.0105 

0.22 36 7 
0.1397 

»a*c£ f . 

0.201*7 
0. Ia5* 

-0.2JJ69 

0.1223 

0.03)21 

OcObdl 

-0.2J369 

0.1223 

0.0637* 

9.4001 

-0.9*066 
0 . 7049 

OATS l 

O.So/Sl 

0.0011 

0.0112b 
U . 4* 1 5 

-0 • 32(522 
0.0277 

0.01123 

0.9*13 

-0.33660 

0.0230 

-0 • C 306 7 
6 ?*1 0 

«rll 

0.5*7/) 

O.OOr.J 

-o. lauG* 
u.2ibb 

0.06 723 
0.660 7 

-0.1000* 

0.2)66 

0.94055 

0.3196 

-0.07460 
C .6632 

£la*1 

1 . OftOuo 
0 . CufiO 

-0. 3b3bO 
g.Ci n 

°0?02h5 

-0 .33330 
0.0172 

0 • -2*09 
0 • OU 37 

0.2 3*2 J 
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asia; 

-0.33 b 
0.01 /2 

1 . ouOjO 

0 . 0 0 U 0 

0.03292 
0.0 100 
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-0**5900 

0.0015 
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t . *262 

SSGI 

0 . )<i 7 *b 

0 • 0 e -2 

0 . 0 J 2 <2 
u . a Juo 

1 .OOOuO 
0.0000 

)2<2 
w . o309 

(i . *6322 
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(t . y f 4oj 
0 .3022 

T aC*£ 5 1 

-0. 13 i^b 
C . 0 1 7 2 

1 .OOuuC 
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0 . O 1292 
0.6300 

1 .00000 
0.0000 

-0 • *5900 
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0.01*20 

0.4262 

P3SG1 

0 ** 2 * *W 
0.00 t 7 

-0 .*540« 
y .001 -a 

0.66322 

0.0001 

-#.*39*3 

0.0013 

1.U0900 
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•PSSO 

0 .?)* i 
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APPENDIX D 


OUTPUT OF COMPUTER RUN: STEPWISE REGRESSION MODEL AT THE COUNTY LEVEL 



STATISTIC* * N A L * s I S S V $ 1 £ » ISISJ TUESDAY. JCPC 23. INI 

survive rcorcssio* proceou re roa «H«Cnr var i aqle pssq 

0A**!*«qi It OQSEPva! IONS 0£t£T£0 uuE TO »ISS!«6 VALUES. 


SUP I VAA1A0L1 

PtSQl laciCPCO 


9 SQUARE • 

A.93*9*9il CIPI 

■ 

I**. 2*4 3A. 

;• 


or 

SU» 

or SQUARES 

^AAU SQUARE 


r 

PO«n*»r 

R» 

£•»*«** 

TulAt 

2*4 

2*4 


3.9*999*',* 

A.2S*0*:22 

*.22M3S4* 

J.HNmU 

A.AA1AS323 


370. 3* 

A.OAAI 


o value 


STD ERROR 

type II S$ 


r 

f«ON»f 

INTERCEPT 

PS sot 

A.»C^M4** 

A.*S*J^?S* 


A. A 1 ****** 

3.NW>N 


17*9*1* 


STEP 2 VARIAOlE 

•PS SO EnTCREO 


m '0»*A *E • 

A.9Sl*S2*l CIPI 

• 

A2.91SSA023 


or 

S&* 

or SOuARES 

•N.*^ SQUARE 


r 

Mon»r 

REGRESSION 

% *JlAL 

is 


*.A227T*73 

t:mssts 

2.111 i47J+ 
A.MAMI23 


2191. AA 

A. *§A1 


0 VAL* £ 


STO ERROR 

TYPE 11 SS 


r 

pRt>0>r 

INTERCEPT 

b^SSS 

PSSQ1 

A.AW2A*** 

§.**SSS4»* 


A.AA21723* 

MIN3S9S 

A.<J*2TAA2T 

j.uHtmj 


*2.7* 

*S3A.*A 

• •Mil 
A.MA4 

Sf£P 3 VA* ABLE 

PiPA*>N ENTERED 
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A.9S742N2A CIPI 

■ 

*3. 7t321 A 1 A 


or 

SUM 

OP «JUAO£S 

fCAM SQUARE 


r 
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K'MSSION 

K*lAL 

?** 


*.A*MV«*t 

A*1791*S2* 

*.22A«JS4* 

i.:***t»iAiA 

A.0AA73722 


IIM.*V 

•.§•#! 


B VALUE 


STO ENRON 

type il ss 


r 

p*<h»t 

INTERCEPT 

•^>SO 

Pt-iAPR 

-A.A*J«7J?* 

6.92N**ASA 


A.A«2STS9i 
A. At 3*ISt* 
O.tOl 73SJS 

0-O7AM>** 
3.S3A7S3** 
0.02fcl l*>*9 


i A*. 7 7 

A. MAI 
A.A6AI 
A.iiil 

STEP A ¥AAIAAc£ 

peprat EnTCRCD 


0 SQUARE * 

A.V*li*AVl CIPI 

. 

2f. *31 31773 


or 

Sun 

Of SQUARES 

ME AH SQUARE 


r 

p»Ci»r 

REGRESSION 
I rnun 
ToTal 

A 

2-2 

2*4 


*.A**77*1* 
A. 1*32S7S* 
*. 22*63*4* 

1.0lfcl**S3 
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IS**. 13 
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type ii ss 


r 

p»o«*r 

lH?t*»CtPT 
• ^SSO 

P*» »'*| 
fr*« wftP* 

♦'t rfNAY 

-0.0 l*7*27j 
3.0^21 ) J 2c 
0.92N?9S*l 
~0.AUi2**8 
-f ;<#e* 


A . AA24332J 

A. A 128 32NS 
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A.toUt 7*« 

• • II* 7V*S* 1 
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A.AOAt 

I.A6A1 

• .AM j 
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0 

1 

ro 
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STEPelSE REGRESSION PROCEDURE FOR OCPE«E*T VARIABLE PSSG 

STEP S VAJtIMLE KJtMC ENTERED A SQUARE * ClPY » 1I.ST39IIM 



or 

SON or SQUARES 

(CAN SOU APE 

r 

p«O0>r 

PC GUESS I ON 
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TOTAL 

id 
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• .•04017 
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•.•••1 
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STO ERROR 

TYPE It ss 

r 
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intercept 
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PEPAPP 
PtVHAf 
Pc « JUNE 

-IrJimiS? 

••92021096 
-•*•10761 IS 
-•.00765121 
•*••264973 

•*••167262 
•.••|4*99« 
• »•• 1 1-9041 

5 :U 2 T??SI 

•*•2769169 

•*•1069339 

•*••319213 

49.62 

S3G4.I5 

41.39 

2>«64 

4*61 

•*•••1 

O-iOOt 

•*•••! 


STEP A VAAIAHlE DCftCS ENTERED 

or 


REGRESSION 
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0 PSSG 

PsSGl 
PE ft APR 
PtwNAY 
Pt^JUNC 


10 
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•.••269903 


ft SOUAfcC • 
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4 .* 6996 » 6 » 

0.1 5 * 0550 « 
4.22ft03S6* 
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•.96261737 ClPI 

HE AH SOU A*E 


••67033ft 1 • 
•*••#•5054 
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__269 
..••166 L_. 

•.•$144436 

•.»«llft649 


TYPE II 5S 


•••3296666 

S . 02755250 

• • 1 M 9 S 01 
t.»«J4»992 


lft. 42 J'v« 2 ft» 

r 

;? 3«.«2 



wo 0 >r 

•.mi 

• .•••1 
•*•••1 
• .Off 
•••236 


STEP 7 
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ft SQUARE » 

• . 9629 MV* 

C<P> ■ 

IS. 99407466 


or 

sum or SQUARES 

l<CAN SQUARE 

r 

PROt»>f 

REGRESSION 

ERROR 
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1 7 

239 
246 

LfsilllfS 

4 . 22 ft« 3 S 60 

f.S«l 04 S]« 

•.•• 0654*9 

644.15 

•*•••1 


0 value 

STD ERROR 

TYPE 11 SS 

r 

PRO d»F 

INTERCEPT 
wP>SG 
DC PUS 
PS SGI 
TEMPAPR 
PtrfAPW 
PtRMAY 
PtMJUNE 

0.9 1073346 
••00047012 
-•. 0 G 9 G 273 S 
- 0.00779596 
O.Oii 256300 

LSIlHil? 

• * 0127196 ** 
0 . 0093070 ? 
•.O 02 O 14 S, 
S . 00164172 
•• 0011065 * 

r.fsiiTii 

5 . S 0647 7 lS 
i.fttf 153566 
• •01315066 
0.01916242 
0.09305561 

* 5 : 5 ! 

S 354, 26 
2.34 
2 «. 0 * 
29.26 
4. *7 

1:101 

5 :t 5 tt 

• .•Ml 
©.•Jii 

STEP & 

variable TEMP APR REmOVEO 

R SQUARE * 

0.96261737 

ctPi * 

10.42144269 


or 
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Kan SQUARE 

r 

PROb>r 

REGrESS ion 
k Pftnft 
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6 

240 

246 

4 . •6996060 

o.isdessoo 

4 . 22 SO 3 S 60 

0. 67033010 
0 . 00065*56 

1030.42 

•.•••1 
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STD ERROR 

TYPE 11 SS 

r 

PR 06 *f 

INTERCEPT 

• PSSCt 
GCPU:> 
PSSwl 
Pi PAPM 
Pt R-^AT 
PE * JU <£ 

- 0 . 01 J 16161 

c.oNAoU* 

-O.OO 007 H 72 

O. 920 a! 7 s 3 

- 0.01077311 

- 0.00771630 

0.00269903 

0.00279902 
O.OO 5 U 0 I 77 
0.01269338 
C.O 0166555 
0.00144436 
O.OOllfcfcvV 

•„ 0329606 * 
0-00201033 

3 - 52 ©] / 3 t>s 

0 . 9275525 a 
4. 01680561 
0 -# 034 o 9^2 

50 , *5 
3 . 0 > 
S 3 S 4 . 3 S 
41-04 
20.56 
5.10 

0.0301 

0 . 0 V 19 

0 - 03*1 

0.000I 

O.oafli 

J 0 


no umlp varublcs «lt T«c o.sooo significance level fop entry into ihe Sooll. 
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-14 
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*t:liTttttt 

imtmi 
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- 1 ) 
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1: vxtm 

ijmtn: 

r.nsm: 
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14.41 tStS44 

-12 


i&U 

ttiitm: 
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i-imiiit 

I4.7N717M 

O r 

-11 

'(:!tfnii! 

llllHHIl 

titmttst 

riitmm 
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titiutm 

14.71»#44»» ▼ 

-It 
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am 

Mi?::::: 
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iiitam 

IrUUmt 
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-4 

-t.tl 7444ft 

t.titttttt 
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t.tttttttt 
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1-jiJJiKl 

w • rwfftWwt 

I4.77|***|t 

/2 - 

-• 

tm 
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i::;mii 


::ui:::: 

14. 744444 M 

r 

-7 

lilitiilil 


iiii?::::: 
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-4 
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i.tt7ttttt 
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-S 
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-f atttftttt 
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-t.tl 7ttCtf 


-t.titttttt 
t . td 'ttttt 
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-t.ttt Ttttt 
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:-o: 

14.74444m 

-J 
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-t.tft Ttttt 
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-2 
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Ml 

Mimm 

litmtm 

iisurnu 
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-1 
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t 
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t.titttttt 

tilth::: 

-t.titttttt 
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-t.tttttttt 
-4 . ttt 7()t4t 

t.tttttttt 

t.titttttt 

t.tl 1 74444 
^.ttitttti 

14.74471722 

1 

iiJStttttt 

t.t)M47|j 

t.t7024|S 

trbdm 

Mftgtl 


IMKiU 


t. 274*4741 


f 


m 

i 

r j 


j 
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-i:nO 

-*.*4||S1*1 

— * • * 1 *S j » os 

0.044*4414 
-#.*•44 JS14 
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*.**S2 124# 
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4.4*1 )S*4) 
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.0.044*4*14 
-0.004* JS 10 
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MRSIt 

0.100*14)) 

-II 

teast 

-8RIS 




!:H3SU 

*.1*0*0110 

-it 

tfiSffi 

-f;M 

:|Jtt 


-!:U!UiS 

MKKIt 

0.100*4271 

-* 

*.044S?0lf 

4.4*1 ISt* 1 

3:8tittJa 


-wistsa 

lilWfttt 

0. 1004)072 

-t 


t.t4S2124< 

-*.i*lOS14J 

-*.9l*SlM 

-0.044*4*|4 

-i.004*3SI* 

•Mlffisa 

*.* 7*S*2 1 • 
*.*|40O«IS 

0.100* )00S 

-7 

-2. 114 7*4*1 
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-s 
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APPENDIX F 
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LINEARIZED MODEL AT THE COUNTY LEVEL 
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APPENDIX G 
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APPENDIX H 


OUTPUT OF COMPUTER RUN: LOGIT MODEL AT THE CRO LEVEL 
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APPENDIX J 


A LISTING OF THE 1980 EXPLANATORY UATA 



























